A Hybrid GeneticMax Algorithm for Improving the Traditional Genetic Based Approach for Mining Maximal Frequent Item Sets
نویسندگان
چکیده
Mining Frequent item sets is one of the most useful data mining methods which discovers important relationships among attributes of data sets. Initially it was developed for market basket analysis, but these days it is used to solve any task where discovering hidden relationships among different attributes is required. Mining frequent item sets plays a vital role for generating association rules, finding correlations and many more interesting relationships among different sort of data. A major challenge in the frequent item set mining task is that it generates a huge number of frequent sub item sets from dense data sets. Researchers proposed mining maximal frequent item sets to overcome this problem. Maximal frequent item sets contain the information of an exponential number of frequent sub item sets since if an item set is frequent each of its sub item sets is also frequent. Very few studies have applied evolutionary algorithms to mine maximal frequent item sets using thorough experimental analysis. In a previous study, we showed the efficiency of using a genetic based approach named GeneticMax to find maximal frequent item sets. In this study we will introduce a new algorithm name, hybrid GeneticMax, which uses local search along with a genetic algorithm to mine maximal frequent item sets from large data sets. The purpose of using the genetic algorithm is that this algorithm based approach is robust and the existing genetic based method which is working fine for a specific problem can be improved by hybridizing it. Experiments are performed on different real world data sets as well as on a synthetic data set. Our new scheme compared favorably to existing GeneticMax under certain conditions which are being evaluated.
منابع مشابه
A Novel Approach for finding Frequent Item Sets with Hybrid Strategies
Frequent item sets mining plays an important role in association rules mining. Over the years, a variety of algorithms for finding frequent item sets in very large transaction databases have been developed. Therefore, a number of methods have been proposed recently to discover approximate frequent item sets. This paper proposes an efficient SMine (Sorted Mine) Algorithm for finding frequent ite...
متن کاملIndexed Enhancement on GenMax Algorithm for Fast and Less Memory Utilized Pruning of MFI and CFI
The essential problem in many data mining applications is mining frequent item sets such as the discovery of association rules, patterns, and many other important discovery tasks. Fast and less memory utilization for solving the problems of frequent item sets are highly required in transactional databases. Methods for mining frequent item sets have been implemented using a prefix-tree structure...
متن کاملFUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING
The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...
متن کاملA Novel Approach to Mining Maximal Frequent Itemsets Based on Genetic Algorithm
We present a new approach based on Genetic Algorithm to generate maximal frequent itemsets from large databases. This new algorithm called GeneticMax is heuristic which mimics natural selection approaches to finding maximal frequent itemsets in an efficient way. The search strategy of this algorithm uses lexicographic tree that avoids level by level searching, which finally reduces the time req...
متن کاملAn Efficient Algorithm for Mining Maximal Frequent Item Sets
Problem Statement: In today’s life, the mining of frequent patterns is a basic problem in data mining applications. The algorithms which are used to generate these frequent patterns must perform efficiently. The objective was to propose an effective algorithm which generates frequent patterns in less time. Approach: We proposed an algorithm which was based on hashing technique and combines a ve...
متن کامل